12. Reinforcement Learning
C01 L01 A11 Reinforcement Learning
You can read about reinforcement learning gone awry in Microsoft's "Tay" Twitter bot in this article.
- Human in the Loop (HITL) refers to having a human-moderator or data annotator that can help with quality control of a product.